A Generative Model of Indoor Scenes
نویسنده
چکیده
In this Master thesis project, we tackle the problem of 3D indoor scene understanding from a single image. Following existing approaches, we parameterize the problem as one of estimating the faces of a 3D cuboid. Towards this goal, we propose a generative model of the scene, which takes advantage of sophisticated image features while utilizing a simple prior learned from the statistics of room photographs. We utilize an adaptive Metropolis-Hastings sampling scheme for inference and demonstrate that our approach outperforms the state-of-the-art on the main two benchmarks that exist for this task. Moreover, our approach outperforms previously published generative models for this problem by more than 10% absolute error.
منابع مشابه
Top-Down Bayesian Inference of Indoor Scenes
The task of inferring the 3D layout of indoor scenes from images has seen many recent advancements. Understanding the basic 3D geometry of these environments is important for higher level applications, such as object recognition and robot navigation. In this chapter, we present our Bayesian generative model for understanding indoor environments. We model the 3D geometry of a room and the object...
متن کاملGenerating Multi-Sentence Lingual Descriptions of Indoor Scenes
This paper proposes a novel framework for generating lingual descriptions of indoor scenes. Whereas substantial efforts have been made to tackle this problem, previous approaches focusing primarily on generating a single sentence for each image, which is not sufficient for describing complex scenes. We attempt to go beyond this, by generating coherent descriptions with multiple sentences. Our a...
متن کاملGenerating Multi-sentence Natural Language Descriptions of Indoor Scenes
This paper proposes a novel framework for generating lingual descriptions of indoor scenes. Whereas substantial efforts have been made to tackle this problem, previous approaches focusing primarily on generating a single sentence for each image, which is not sufficient for describing complex scenes. We attempt to go beyond this, by generating coherent descriptions with multiple sentences. Our a...
متن کاملA Generative Model of 3D Object Layouts in Apartments
Understanding indoor scenes is an important task in computer vision. This task is typically ambiguous, so we require a strong prior, that captures the regularity of indoor environments. This is naturally expressed by a probabilistic model over 3D room layouts and geometry, reasoning over complex layouts in 3D space, including high-order spatial relations among objects. In this work, we construc...
متن کاملIndoor Scene Recognition Through Object Detection Using Adaptive Objects Search
Scene recognition is a highly valuable perceptual ability for an indoor mobile robot, however, current approaches for scene recognition present a significant drop in performance for the case of indoor scenes. In our recent work [1], we argued that this can be explained by the high appearance variability of indoor environments, stressing the need to include high-level semantic information in the...
متن کامل